Disclosure Risk from Factor Scores in a Remote Access Environment
نویسندگان
چکیده
Remote access is a promising tool for broadening the access to microdata without violating confidentiality requirements. In a remote access setting the user submits queries to a system provided by the statistical agency and only the results of the queries are reported back to the user. Since no direct access to the data is granted, generally no alteration of the underlying microdata is required. Still, remote access bears the risk of disclosing sensitive information even though the actual data are not directly available. Most disclosive queries are easily detected and can be suppressed by the system. However, more complex procedures such as multivariate analyses can also lead to a breach of confidentiality if applied in a sophisticated manner to exploit certain features of the data. In this paper we illustrate how an intruder could employ commonly used factor analysis to obtain sensitive information regarding the underlying microdata. We present the general concept and evaluate the approach using a German establishment survey, the IAB Establishment Panel. We find theoretical and empirical evidence for a high risk of disclosure from factor analysis.
منابع مشابه
Data Dissemination and Disclosure Limitation in a World Without Microdata: A Risk-Utility Framework for Remote Access Analysis Servers
Given the public’s ever-increasing concerns about data confidentiality, in the near future statistical agencies may be unable or unwilling, or even may not be legally allowed, to release any genuine microdata—data on individual units, such as individuals or establishments. In such a world, an alternative dissemination strategy is remote access analysis servers, to which users submit requests fo...
متن کاملRemote Data Access and the Risk of Disclosure from Linear Regression: An Empirical Study
In the endeavor of finding ways for easy data access for external researchers remote data access seems to be an attractive alternative to the current standard of data perturbation or restricted access only at designated data archives or research data centers. However, even if the microdata are not available directly, disclosure of sensitive information is still possible. We illustrate that an i...
متن کاملThe Impact of Conservatism in Risk Disclosure on Investment Efficiency considering Information Asymmetry
In firms with conservative risk disclosure more validity is needed to disclose good risk news and a lower standard of validity is needed to disclose bad risk news. Conservatism in risk disclosure(CRD) as a regulatory mechanism can be effective in reducing the investment inefficiency. The aim of this research is to investigate the effect of CRD on investment efficiency. The research period from ...
متن کاملStatistical disclosure control architectures for patient records in biomedical information systems
Patient record data are potentially highly sensitive and their secondary use raises both ethical and data protection issues. Disclosure of patient data could cause serious difficulties for the medical profession and be potentially damaging for individual patients and clinicians. Yet at the same time patient records are a hugely valuable resource in terms of clinical research and patient treatme...
متن کاملIntroducing Satellite Remote Sensing Systems and its Application in Archaeology Case Study: Behshahr Plain- Mazandaran
Human groups have considered the Behshahr plain of Mazandaran in the past Due to its particular geographical shape, location between the Caspian Sea and mountains, and the existence of some rivers in the region. However, our knowledge of this area is limited to several published surveys and archaeological investigation of its ancient sites. No detailed research has conducted on the formation of...
متن کامل